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ABSTRACT 

The past few years has witnessed the great success of recom- 
mender systems, which can significantly help users find rel- 
evant and interesting items for them in the information era. 
However, a vast class of researches in this area mainly focus 
on predicting missing links in bipartite user-item networks 
(represented as behavioral networks). Comparatively, the 
social impact, especially the network structure based prop- 
erties, is relatively lack of study. In this paper, we firstly 
obtain five corresponding network-based features, including 
user activity, average neighbors' degree, clustering coeffi- 
cient, assortative coefficient and discrimination, from social 
and behavioral networks, respectively. A hybrid algorithm 
is proposed to integrate those features from two respective 
networks. Subsequently, we employ a machine learning pro- 
cess to use those features to provide recommendation results 
in a binary classifier method. Experimental results on a real 
dataset, Flixster, suggest that the proposed method can sig- 
nificantly enhance the algorithmic accuracy. In addition, as 
network-based properties consider not only the social activ- 
ities, but also take into account user preferences in the be- 
havioral networks, therefore, it performs much better than 
that from either social or behavioral networks. Furthermore, 
since the features based on the behavioral network contain 
more diverse and meaningfully structural information, they 
play a vital role in uncovering users' potential preference, 
which, might show light in deeply understanding the struc- 
ture and function of the social and behavioral networks. 
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I. INTRODUCTION 

Network sciences have provided us powerful and versa- 
tile tools to well recognize and understand various systems, 
ranging from economic systems to human society, from com- 
puter sciences to biology, and so on [?, ?]. There is a vast 
class of works in this area mainly focusing on independently 
studying the Social Networks and the Behavioral Networks. 
Social Networks, composed of identical individuals, describe 
the properties of essential components and various kinds of 
interactions among them [?]. Behavioral networks, on the 
other hand, consisted of two different components, users and 
items, involve not only the user profiles and item properties, 
but also contain user-item behavioral records. Consequently, 
they constitute a so-called Bipartite Graphs, on which re- 
searchers have devoted much effort to predict user potential 
preferences via mining the binary relations [?]. However, 
although social networks have been realized to play impor- 
tant roles in uncovering user hidden interests, the majority 
of works on this topic mainly concentrated on making use of 
the trustworthiness among individuals to make better rec- 
ommendation results [?, ?]. During the last decade, with the 
rapid growth and wide application of Social Network Ser- 
vices (SNS) (e.g. Twitter.com, Facehook.com, Weibo.com, 
etc.), users in those platforms can not only keep social con- 
tacts with friends, but also can share common interests, such 



as movies, stories, music and so on. That is to say, such so- 
cial activities can benefit individuals in effectively finding 
more interesting items. Furthermore, those counnon inter- 
ests/behaviors can conversely enhance the social relation- 
ships and help users building new relationships with conge- 
nial friends. Therefore, there is huge overlap between social 
and behavioral networks, in which the users play as coupling 
nodes, which drive the interdependent [?] or interconnected 
[?] properties and related functions. 

Recently, Recommender Systems, aiming at helping users 
find relevant and interesting items from the information era 
for them. The most widely adopted framework is the clas- 
sical Collaborative Filtering (CF) [?, ?], which provides the 
recommendation by taking into account effects of the most 
similar users/items. Nowadays, many pioneering works have 
tried to apply social factors into the CF framework [?, ?, 
?]. However, most of them focused on estimating the simi- 
larities among users, respectively from social and behaviors, 
and then hybriding the two metrics in order to obtain better 
recommendation performance. Despite its effectiveness, the 
network structure, which contains rich information of users' 
social and behavioral preferences, is relatively lack of atten- 
tion. Actually, such network structure has common features 
for both social and behavioral networks, e.g. active users 
arc more likely to make acquittance with more friends, as 
well as collecting more items. And similar users attend to 
like similar movies. All these aspects can be described and 
analyzed by network structure properties. In this paper, 
we discuss this issue via building a machine learning model. 
Basically, the research problem that we are interested in, is 
related to following points: 

• How we simultaneously define the corresponding struc- 
tural features from social and behavioral networks? 

• With a binary classifier, how wc evaluate the corre- 
lations and differences between social and behavioral 
networks? 

We firstly obtain five corresponding network-based fea- 
tures, including user activity, average neighbors' degree, clus- 
tering coefficient, assortative coefficient and discrimination, 
from social and behavioral networks, respectively. We then 
employ a machine learning process to use those features to 
provide recommendation results in a binary classifier method 
with ensemble learning. Experimental results on a real dataset, 
Flixster, suggest that it can significantly enhance the algo- 
rithmic accuracy. 

This paper is organized as follows. Section 2 is about 
the related works on coupling networks and recommender 
systems. In Section 3, we describe the extracted features 
and the proposed model. Section 4 describes the data and 
shows experimental results. In Section 5, we conclude our 
work and discuss the future works. 

2. RELATED WORK 

2.1 The Coupling Networks 

A large number of measures have been proposed to charac- 
terize network properties. Examples include the clustering 
[?], network modularity [?], and degree correlation [?], etc. 
These measures demonstrate that real networks indeed in- 
volve meaningful local structures (e.g. Network motifs [?]), 
and provide useful information for various applications. 



The coupling networks, also known as interdependent net- 
works, normally contain a two-layer network, such as elec- 
tricity and Internet networks [?], airport and railway net- 
works [?]. There is a kind of coupling nodes, such as cities in 
the two aforementioned networks, which play the intercon- 
nection and maintenance roles between these two-layer net- 
works. Consequently, those nodes are critically important 
for the robustness of whole networks [?]. Social networks, 
similar with the interdependent networks, also involve such 
coupling nodes, users, who both make firiends and collect 
items. Therefore, those users are especially vital for main- 
taining the structure, connectivity and robustness of social 
and behavioral networks. 

2.2 Social Recommender Systems 

The past few years have witnessed the great success of rec- 
ommender systems, which can significantly help users find 
relevant and interesting items for them in the information 
era [?]. Recently, many efforts have been devoted to study 
social networks to explore collaborative interactions or infiu- 
ences in order to produce more reliable recommendations [?] . 
[?] suggested a mixture between social infiuence and simi- 
larity of activities to predict future behaviors. [?] presented 
a measurement analysis of various online social networks by 
studying the corresponding topological properties, and then 
make recommendations. Collaborative filtering based on so- 
cial networks, wherein user-selected like-minded alters are 
were to make predictions, outperformed traditional user- 
based CF in predictive accuracy [?]. [?] proposed a new 
behavioral network based CF, exploited navigational pat- 
terns and transitive links to model users, analyzed behavior 
similarities, and eventually explored missing links. 

In addition. Machine Learning has been proved to be a 
very useful tool in enhancing the recommendation accuracy 
[?]. [?] described several algorithms to use user preferences 
to predict additional topics, including techniques based on 
correlation coefficients, vector-based similarity calculations, 
and statistical Bayesian methods. [?] solved the prob- 
lem of seeking new jobs with a supervised machine learning 
method. [?] proposed to combine rule patterns and super- 
vised learning to extract semantic music information. 

However and to the best of our knowledge, there is no 
previous work on using machining learning process to make 
recommendations from coupling network structure features, 
which is the main contribution of this paper and will be 
presented next in detail. 

3. FEATURES AND MODEL 

For the observed social network, we firstly extract the 
largest connected sub-graph, in which every node represents 
a user, and each edge is undirected. Then, we refine the be- 
havioral network to guarantee that the two networks have 
the same group of users. Finally, we make an interdependent 
network consisted of this social and behavioral network, and 
then extract the features and transfer it to a binary classi- 
fication problem. In particular, we choose five representa- 
tive features, including user activity, average degree, clus- 
tering coefficient, assortative coefficient and discrimination, 
defined as following: 

• User Activity (A) - it refers to the total number of 
user Wj's friends in social network, while in behavioral 
network it means the number of movies watched by Wj. 



• Average Activity of Neighbors (AN) ~ defined as ^A^,; — 
-, wliere Ti is the set of user Ui's neighbors. In 



social network, it is m^'s friends list, and it refers to all 
Mi's watched movies in the behavioral network. 



Clustering Coefficient (C) 
defined as a - S,-™'^.,-.™ 



For social networks, it is 

"" , where Uij = 1 if node i 
is connected to node j, otherwise aij =0. Thus, d, in 
social networks, expresses the likelihood for two neigh- 
bors j and m of node i are connected; for behavioral 

networks, it is defined as d — 34^7(^4-37), where Sjfc 
is the Jaccard index [?]. d, in behavioral networks, 
expresses how similar the user Mi's movies are. It also 
suggests how diverse Mi's taste is. 



Assortative Coefficient (AC) - defined as 



Ad 



: , and Ad € 



[—1, 1]. Mi is more likely to connect reversely active 
users (or popular items in behavioral networks) if Ad < 
0, and vice verse. 

• Discrimination (D) - defined as D = 1 — 

A, 

where Aj runs over all possible activities of Mi's neigh- 
bors and p{Aj) is the ratio of ylj -activities to total dif- 
ferent activities of user Mi. If Mi connects to users (or 
items) with only one kind of activity (e.g. Aj — 10), 
Di = 0. The larger D, the more discriminating of user 
Mi's neighboring activities. 

We then employ the machine learning process to evaluate 
the correlations and differences between social and behav- 
ioral networks. Given a set of users, each is described by 
the defined features (each feature is normalized) as the in- 
put variables, and labeled to indicate whether this user and 
its friends in the social network have watched randomly se- 
lected movies. We then adopt the c4-5 to be the weak clas- 
sifier and the AdaBoost [?] to form the strong classifier. To 
better investigate effects of social and behavioral features, 
we introduce a tunable parameter, a £ [0,1], where a refers 
the social impact, and 1 — a simultaneously corresponds to 
the impacts of behavioral features. In the extremal cases 
a = and a = 1, features are taken into account only from 
social and behavioral networks, respectively. Subsequently, 
we apply the decision tree algorithm to weighted training 
data and then take the classifier with the highest score as 
the strong classifier, and eventually use it to predict testing 
instances. 

4. DATA AND RESULTS 

In this paper, one representative dataset, Flixster.com, 
a social movie rating website (with five-star rating level), 
is used to evaluate the proposed method. We purify the 
data in order to remove noisy data by: (i) extracting the 
largest connected sub-graph from the social network; (ii) 
deleting the absent users from the behavioral network to 
guarantee all users to be existing in the social network, and 
we delete the movies which are never watched by the users 
in the social network. Finally, we get the social network 
with 137,372 users and 2,430,282 undirected links, and the 




Figure 1: Precision (recommendation accuracy) ver- 
sus a. Each line represents the result of one dividing 
ratio of training and testing data (denoted as r). 



behavioral network with the same number of users, 48,756 
movies, and 8,062,853 user-item-rating behavioral records. 
For the binary classifier, we divide users into two classifiers: 
we mark 1 to the users who and friends have watched ran- 
domly selected popular movies with rating larger than 3, 
and otherwise. 

Figure [T] shows the recommendation accuracy (we use Pre- 
cision in this paper) for a £ [0, 1]. It is can be seen that, for 
different ratios of training data, the optimal result reaches at 
a = 0.2. Considering that a indicates the impact of features 
of social network structure, it suggests that, for Flixter, the 
behavioral network is more significant than the social net- 
work in predicting users' potential-like movies. The optimal 
a > also indicates that the social features indeed can im- 
prove accuracy of recommendations. 

5. CONCLUSIONS AND DISCUSSION 

In this paper, we have proposed to use network structure 
based features of coupling networks, and combine a machine 
learning process to evaluate the performance of the extracted 
properties in recommendation accuracy. The experimental 
results show that the behavioral information plays more im- 
portant role in the observed data, Flixter. In addition, the 
results also shows that the social features can benefit in im- 
proving recommendations accuracy. 

This article only provides a simple start point for the de- 
sign of hybrid algorithms making use of social information, 
while a couple of issues remain open for future study. Firstly, 
we lack quantitative understanding of the structure and dy- 
namics of coupling networks. Although the relation between 
the coupling property and recommender systems is not clear 
thus far, we deem that an in-depth understanding of cou- 
pling networks should be helpful for better recommenda- 
tions; Secondly, the current algorithm only considers the ef- 
fects of five representative network-based features, and more 
meaningful features should also be taken into account, thus 
the Social Network Analysis (SNA) based techniques can 
be used to provide more substantial recommendations, and 
social predictions as well. Thirdly, the experiment in this ar- 
ticle is only performed on one dataset. For different coupling 



networks, the social impact sliouki perform quite differently, 
according to their respective underlying structure. Finally, 
this paper provides a promising way to identify the impor- 
tance of different features in driving the connectivity and 
robustness of various coupUng networks. 
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